Artificial Neural Network & Mel-Frequency Cepstrum Coefficients-Based Speaker Recognition

نویسندگان

  • Adjoudj Réda
  • Boukelif Aoued
چکیده

Speaker recognition is the process of automatically recognizing who is speaking on the basis of individual information included in speech waves. This technique makes it possible to use the speaker’s voice to verify their identity and control access to services such as voice dialing, banking by telephone, telephone shopping, database access services, information services, voice mail, security control for confidential information areas, and remote access to computers. This document demonstrates how a speaker recognition system can be designed by artificial neural network using MelFrequency Cepstrum Coefficients of voice signal. Note that the training process did not consist of a single call to a training function. Instead, the network was trained several times on various input ideal and noisy signals coded by MelFrequency Cepstrum Coefficients, the signals which contents voices. In this case training a network on different sets of noisy signals forced the network to learn how to deal with noise, a common problem in the real world.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Throat Microphone for Speaker Recognition Using AANN

In this paper, we have analyzed the performance of speaker recognition system based on features extracted from the speech recorded using throat microphone in clean and noisy environment. In general, clean speech performs better for speaker recognition system. Speaker recognition in noisy environment, using transducer held at the throat results in a signal that is clean even in noisy. This speak...

متن کامل

Significance of formants from difference spectrum for speaker identification

In this paper, we describe a prototype speaker identification system using auto-associative neural network (AANN) and formant features. Our experiments demonstrate that formants extracted from difference spectrum perform significantly better than formants extracted from normal spectrum for the task of speaker identification. We also demonstrate that formants from difference spectrum provide com...

متن کامل

A Novel Approach for Text-Independent Speaker Identification Using Artificial Neural Network

This article presents the implementation of Text Independent Speaker Identification system. It involves two parts“Speech Signal Processing” and “Artificial Neural Network”. The speech signal processing uses Mel Frequency Cepstral Coefficients (MFCC) acquisition algorithm that extracts features from the speech signal, which are actually the vectors of coefficients. The backpropagation algorithm ...

متن کامل

Mel Frequency Cepstral Coefficients for Speaker Recognition Using Gaussian Mixture Model-Artificial Neural Network Model

Speaker Recognition (SP) is a topic of great significance in areas of intelligent and security. In Biometric SP using automated method of verifying or recognizing the identity of the person on the basis of some application, such as a finger print or face pattern and human voice. Many method have been proposed in the literature are focusing on front end processing such as PLP and LPC. In this pa...

متن کامل

Score Level Fusion Based Personal Authentication Using Fingerprint and Speech

In this paper development of a multimodal based biometric fusion system is discussed. A fingerprint recognition system is developed using global singularity features. Mel-frequency Cepstral Coefficients are used to recognise a speaker using the backpropagation artificial neural network. A score level fusion based recognition system is developed using fingerprint and speech match scores and the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005